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Abstract 

We settle an open problem, raised by Y. Peres and D. Revelle, concerning the L 2 mixing 
time of the random walk on the lamplighter graph. We also provide general bounds relating 
the entropy decay of a Markov chain to the separation distance of the chain, and show that the 
lamplighter graphs once again provide examples of tightness of our results. 

1 Introduction 

Given a finite connected graph G7, a vertex of the lamplighter graph consists of a 0-1 labeling of 
the vertices of G, and a marked vertex of G. Each vertex has a lamp, the marked vertex indicates 
the position of a lamplighter and the labeling at any time indicates the off-on status of each lamp 
(vertex). The lamplighter random walk on G^ corresponds to the lamplighter performing a random 
walk on G, while randomizing the status of each lamp, as he/she visits the corresponding vertex. 
When G is a cycle or a complete graph, the corresponding lamplighter chains were studied by 
Haggstrom and Jonasson [6]. Vertex transitive, other special classes and more general graphs were 
considered in detail by Peres and Revelle [9], who provided general upper and lower bounds for 
mixing times of the lamplighter random walk. 

By tightening the analysis in [9], we prove an optimal upper bound on the L 2 mixing time of 
the lamplighter Markov chain on a class of graphs considered in [9] . The mixing time and related 
measures are defined via 

Definition 1. Let G be a connected d-regular undirected graph. 

• The relaxation time T re i(G) = max^ 1/(1 — |A|) where the maximum is taken over non-trivial 
eigenvalues A of the normalized adjacency matrix of G. 

• t{G) (t2(G)) is the time for the random walk on G to be within 1/4 of uniform distribution 
in total variation distance (L2-distance, respectively). 

• H(G) is expected time it takes for the random walk to travel from x to y, where the choice 
of x and y is adversarial. 

There is a more popular definition of relaxation time (see e.g. [2, 8]). We choose this definition 
as it is easier to work with. Standard inequalities imply that the two numbers differ by at most 1. 

For a graph G, let T2(G^) denote the L 2 mixing time of the lamplighter random walk on G® . 
Then our main theorem is as follows. 
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Theorem 2. Suppose G is a regular undirected graph for which, H(G) < k\G\, for some universal 
constant k > 0. Then there exists a constant c = c{k) such that 

r 2 (G°) <c|G|(T re/ (G) + log |G|). (1) 

The above theorem refines and improves upon (by providing a matching upper bound to) the 
result of Y. Peres and D. Revelle [9], who proved (inter alia) the following theorem. 

Theorem 3 (Peres- Revelle). Under the hypothesis of Theorem 2, there exist constants, ci,c 2 
depending on k such that 

d|G|(T rrf (G0+log|G|) <T2(G<>) <c 2 |G|(r tv (G)+log|G|), (2) 
where Tt v (G) is the total variation mixing time of the simple random walk on G. 

Note that our theorem shows that for G = Z?j, the correct order of magnitude for r 2 (G^) to be 
n2 n , since the relaxation time is of order n. This settles Problem 4 mentioned at the end of [9], 
while our theorem itself settles the question raised as Problem 5 in the affirmative. 

In [9] , the lamplighter random walk on the two-dimensional torus was shown to be an example of 
a chain for which the relaxation time, the total variation mixing time, and the L 2 mixing time were 
all shown to be distinct orders of magnitude. In this paper, we use the one and two- dimensional tori 
as examples which further separate the mixing time in entropy (relative to stationarity) from the 
rest of the above mixing times. These examples also illustrate tightness of the following other result 
of this paper. We show that in general the entropy mixing time is at worst a factor of log log(l/7r*) 
larger than the total variation mixing time for reversible Markov chains (see Corollary 12) below.) 
This is accomplished by relating the relative entropy to the so-called separation distance of a Markov 
chain. 

2 The Lamplighter Result 

In this section we derive some preliminary technical lemmas and a key theorem from which the 
main theorem follows. Since random walks on regular undirected graphs are equivalent to reversible 
Markov chain with uniform stationary distribution, we will use the latter from now on. 

Assume that P is a Markov chain with uniform stationary distribution ir on a finite state space 
X. Let H — m&x X y W, x Ty denote the maximal hitting time (also called the maximum expected first 
passage time) of the chain. Let T re \ denote the relaxation time of the chain, where A is the spectral 
gap of the chain. 

As observed by Peres and Revelle, the L 2 mixing time of the lamplighter graph G^ depends 
upon the moment generating function of the cover time of the underlying graph G. More precisely, 
if St denotes the set of unvisited vertices (by the lamplighter) by time t, then to get convergence 
in the L 2 (or equivalently, in the uniform metric), one needs E2l St l < 1 + e, for e > 0. Our mam 
technical contribution is as follows. 

Let P be a reversible Markov Chain on the state space X with ir as the stationary distribution. 

Theorem 4. Let the chain given by P start in an initial distribution fi so that [i > ir/2. Let the 
maximal hitting time H = H(¥) satisfy H < c\\X\ for a constant c\ > 1. Let 9 > 2 be arbitrary, 
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and let St denote the set of vertices which have not been visited by time t. Then there exists a 
universal constant c such that for all a, b > 0, and for t > t' = C\ \X\T re i \og6 + C% \X\ log \X\, we 
have 

E[0 |St| ] < 1 + S + 5 2 + <5 9 , 

where S = Q-( 1+2a ) T rei \X\~ b , C\ = 2cc\ (1 + a) and C2 = ccf (1 + b). In particular, when a = b = 1, 
we have E[6^] < 1.21. 

Once we have Theorem 4, the main theorem (Theorem 2) follows in a straightforward way, as 
in [9]. We begin with a few simple lemmas. 

Let P be a Markov chain with uniform stationary distribution ir. Let o~\ denote the second 
largest singular value of P. In particular, if P is reversible, then o~\ = max(Ai, |Ajv— 1|), where 
1 = Ao > Ai > • • • > Atv_i denote all the eigenvalues of P. Recall the following basic fact, whose 
proof we include for completeness. 

Lemma 5. Let f : X — * R. Let Xj denote the state of the chain P at time i. Then 

Cov(/(Xi), /(X t )) < o\ Var(/(Xi)) , 
where o~\ is the second largest singular value of TP. 

Proof. Let g = P*/, so that Cov(/(Xi), f(X t )) = Cov(f,g). Hence we have 

Cov(/,<7) = ^[{f -Ef){g-Eg)\ 

= E 7r [{f-Ef)(F t f-EF t f)] 
= E 7T [(f-Ef)((F t -E)f)} 
= E 7T [(f-Ef)((F-E) t f)] 

< ^/E n ([I-E)fYa\ y/VaiM 
= Xl Var^/) 

where we used the fact that the operator norm of P — E is o~\. □ 

Lemma 6. Let {Xt} be a reversible Markov chain on X , with uniform stationary distribution ir. 
Assume that Pt{Xq = x} > n(x)/2. Let T re i be the relaxation time of the chain. Let denote 
the return time to x and assume there are e, 5 > for which 

Pr(T+ > e\X\) > 5 > 

X 

for all x € X . Let y C X be such that \y\ > T re \. Then the "probability of hitting at least 5e\y\/4 
elements of y by time C5~ 2 T re i/Ti(y) is at least 1/2, where C > 16 is an absolute constant. 

Proof. Let r = eT re i/ir(y). For 1 < % < r, let Ij be an indicator random variable for the event 
{Xi 6 y} and Jj for the event {Xi 6 y} and {Xj 7^ Xi} for i < j < r. Finally let J = J2i J% an d 



(EF = E) 

P* - E = (P - Ef 



Note that J is the number of distinct elements of 3^ which have been visited in the time interval 
[1, r\. Also we have Pr{Jj = l|/j = 1} > 5 since r < e\X\. This together with the fact that 
IE[Zj] > ir(y)/2 (due to our assumption on the initial distribution), gives 

E[J] > SE[I] > 5r7r{y)/2 = . (3) 
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To conclude Pr{J > e5T re //4} is bounded away from 0, we bound E[/ 2 ]. 
Fix 1 < i < j < r. From Lemma 5 we have 

E Cov(I h Ij) < E Xt' Var(/ i ) < = T rel Var(/ i ) < T rel M[Ii] . 

Now we have 

E[/ 2 ] < 2 E E Cov ( J - J i) ^ 2 E r re|E[/<] = 2T rei E[7] . 

Since J* < ^ we have E[J 2 ] < E[/ 2 ] < 2T rel E[I] < 2T re iE[J]/5, since E[JJ > <5E[Ii]. Hence 
using Equation 3 we have 

E[J 2 ]< A E[J] 2. (4) 

Now let a be the indicator for the event J > E[J]/2. Then E[J(1 — a)] < E[J]/2 and hence 
E[Ja] > E[J]/2. Now by Cauchy- Schwartz, we have E[Ja] 2 < E[J 2 ]E[a 2 ]. Hence 

r w , r 2n EfJal 2 (E[J]/2) 2 5 2 e 

Thus in a trial of length r = e[A?|T re j/|3^[, the probability that we do not pick up Se\y\/4 elements 
of 3^ is less than 1 — 5 2 e/16. Hence if we repeat this for CS~ 2 /e intervals of length r each, we can 
reduce the probability of failure to less than 1/2. Note that C > 16 here. □ 

Suppose that the initial distribution [i is such that fi > tt/2. Then if the current set of un- 
visited states is large (> T re i) then Lemma 6 shows that we visit Q(T re i) new states within time 
0(T re i/ir(y)) with probability > 1/2. Once the set of unvisited states gets smaller than T re i things 
are in better shape. The next lemma establishes the assumption of Lemma 6 and handles the case 
when the set of unvisited vertices is small. 

Lemma 7. Let ¥ be a Markov chain with uniform stationary distribution and maximal hitting time 
H . For x € X , let T% denote the expected length of the return time to x. Then 



minPr(r 7 f > — \ > — . 
x x\ x - 2 / ~ 2H 

Also, for any y C X, with probability > 1/2, we visit at least \y\/2 elements of y by time 4H. 

Proof. Since the stationary distribution is uniform, E X [T+] = \X\. If after \X\/2 steps we have not 
yet returned to x, and are currently at state y, then we expect to visit x within another H steps. 
Hence 

\X\ = E X [T+] < Pr{T+ < \X\/2}\X\/2 + Pi^T 7 > \X\/2}H . 

Rearranging terms, we get the result. 

For the second result: Fix x € 3^ and let H x denote the time when x is visited. E[H X ] < H. 
Thus by time 4H we visit x with probability > 3/4. 

Let Y denote the number of elements of 3^ which have been visited by time 4H. Then E[Y] > 
3\y\/4. If q = Pr{Y > \y\/2}, we have 

3\y\/4<E[Y]<(l-q)^ + q\y\. 
Solving for q gives q > 1/2. □ 
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Proof of Theorem 4. Let r = \\X\/T re i\ and for i = 0, 1, . . . ,r — 1, let ki = — iT re i and for 
i = r, k r = 0. Define stopping times Tj as the time when |St| = ki for the first time. 

From Lemma 7 and Lemma 6, it then follows that Tj — Tj_i is stochastically dominated by on = 
j/kiZi where Zj is geometric with mean 2 and 7 = C ■ K 2 \X\T re i and C > 16 is a universal constant. 

Fix t > 0, i < r and (3 > be arbitrary, Then 

i . * 

Pr{T, >t} = Pr{^ a, > t} < Pr{^ ^Z* > t} 

j'=l 3=1 * 

exp(-t/3)E[^^Z i ] <exp(-</3)nE[^Zi 



< 



3=1 



3=1 



Choose /3 so that /? = ki/3j so that 7/3 < kj/3 for all j < i. For a < 1/3, E[aZj] < exp(3a). This 
gives 

"7/3, 



E 



< exp(kifkj) . 



Hence we have 



A. 1 



Pr{Ti >t}< expf-i-i- + V tM < exp(-t-i + -i- log \X 
V 37 f-f kj J V 37 T re/ 



ki ki 



37 ' f-f kj 

3=1 J 

For i = r, Lemma 7 implies (T r — T r _i) is stochastically dominated by the sum of £ = log 2 (2T re /) 
independent geometric random variables with mean 4K\X\. Applying a Chernoff bound we get 



Pr{T r - T r _i >t}< exp 



t 



4JC\X : 

Breaking the values of \St\ into intervals of size T re i we have 



+ ^log( 



t 



AK\X\ 



(5) 



E[0 |St| ] < 1 + 6 ki+Trel Pr{|5*| > h} = 1 + ^ 6» fcl+Tl - ei Pr{Tj > i} 



i=0 



i=0 



For i < r, k{ > T re i and hence 



0fc<+T rei Pr{Ti > t} < exp / (A;i + r j log0 _ A + A. log |Af 

v 37 T re / 

<exp(2fcj log 9 + A. log \X\ - t£- 



(6) 



When i = r, = k r < T re / < fc r -i and hence 



6) fcr+T re! Pr | T) _ > t } < gT rel (PrjT^i > t/2} + Pr{T r - T r _i > t/2}) 



< exp — 



t k 



r— 1 



+ 



log|Af 



2 37 T rd 
+ 6» Trei exp f I - n J t — + t log / 



(7) 



8iT|Af| 
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where I = log 2 (2T re z). 

Let t' = QCK 2 \X\ (2(1 + a)T re i log 9 + (1 + b) log | *|) for any a, 6 > and hence take c = 6C. 
now show that for t > t' , E[#l s *'l] — 1 is small. Recall that 7 = CK 2 \X\T re i, hence we have 

(- = 4(1 + a) log + 2(1 + 6)^1 

^ (2(1 + a)T rel log + (1 + b) log |*|) 



8i<C|*| 4 
,/ , / g \ . , ( K}- + a)CK log 6-2Xrel \ 
1 := Msi^i " l0g l 4 J ~ l0g(6 ' (2Tj - ei)) ' 

since C > 16, K > 1, 6> > 2. 

Now for t > t' and i < r, Equation 6 reduces to 

e k i+ T rel p r{T . > ^ < exp f 2 ki \ogO + p- log |*| - 4(1 + a)ki \og6- 2(1 + 6) A. log |*| 

_ g-(2+4a)fc i |^|-(l+2fe)fc i /T r . ei _ 

And Equation 7 reduces to 

e r rel Pr | T . > t /| < 9 T rel exp ^_ 2 (i + a )^ r _ 1 i og _ (1 + log* + ^=1 log* 

+ T <-< exp(^-exp(£') + ^' 



< exp(-(l + 20)^-1 log - 6^ log |*| 

V J-rel 

+ expfr rei lo g + (1 + £')(£' - log(6)) - exp(f ) 



using I = log 2 (2T re ;) and £' > £ + log(6). Here we also use the fact that f(x) = exp(x)/4 
x)(x — log(6)) > for all x > 0, and k r -\ > T re i. 

We now have 

QTrel P r {Tj > t'} < 6)-(l+2a)T re; |^|-6 

+ exp(r rei lo g 9 - ^(2(1 + a)T re/ log0 + (1 + b) log |*|)) 

< e-Q+WTr* |^| -6 + exp ^_9(i + 2a)T re/ log - 9(1 + 6) log |*| 

< ^-(1+20)^1^1-6 + 0-9(1+20)2^ |^| -9b ; 

using C > 16. 

Combining Equation 10 and Equation 8 we have 
r-l 

E[6»l S t'l] < 1 + y^5/-(2+4o)fc i | < Y|-(l+26)fc i /T,. ej + 0-(l+2a)T r . e! | A ,|-6 + -9(l+2a)T rej | A ,|- 



j=0 
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Now let 7] = 0-(i+2a)T reI> Using Av-i > iT rei , we get 

EpISfl] < 1 + £ ^(r-Oj^l-^Cr-i) + + 
i=0 

Summing up the geometric progression and simplifying we get 

E[6>l s ''l] < 1 + V 2 \X\~ 2b + rj\X\- b + if\X\^ b . (12) 

Observe that by setting a = b = 1 and using 9 > 2,T re / > 1/2, | > 2, we get 77 < 2~ 3 / 2 and 
I^T 6 < 1/2. Finally, substituting in Equation 12 we have E[6>l s *'l] < 1.21 . □ 

Proof of Theorem 2. Implicit in the proof of Theorem 1.4 of [9] is that for a suitable constant 
C > 0, 

r 2 (G <> ) < C (r 2 (G) + nun |e[2I 5 *I] < 2}) . (13) 

Since G is undirected, the random walk on G is reversible. Hence after time 4t(G), the distri- 
bution n of the lamplighter's position satisfies fj, > ir/2. Since the random walk on G is reversible, 
we have (e.g., by [3]) that the mixing time is bounded above by maximal hitting time H(G). Thus 
by running the Lamplighter chain for an initial 0{H{G)) steps, we can ensure that the assumption 
of Theorem 4 holds. 

Hence Theorem 4 implies E[2l 5 *l] < 2 for t = 0{H(G)) + 0(\G\ ■ (T rel + log \G\)). Thus (13) 
gives 

r 2 (G < >) = 0(r 2 (G) + \G\-(T rel + log\G\)), (14) 

since H(G) < k\G\. Finally, the regularity of G implies that the stationary distribution of the 
random walk on G is uniform. Thusr2(G) = 0{t{G) log |G|) implying that r 2 (G) = 0(|G| log |G|).D 



3 Separation Distance and Entropy Decay 

Recall that Pinsker's inequality lets one bound the total variation mixing time of a Markov chain 
from above by the entropy decay time, up to an absolute constant. In this section, we show that 
for reversible Markov chains, the time for the relative entropy to decay to within 1/e is no larger 
than log log (1/71"*) times that of the total variation mixing time. We actually prove a more general 
result for all Markov chains from which the above will follow under the additional assumption of 
reversibility. 

Once again let P be a Markov kernel with stationary distribution ir on a (finite) state space X. 
First recall the definition of separation between a chain at time n > and tt. 

Definition 8. For n G N and x G X, set 

/ P n (x, y)\ 

sepp(x, n) = max I 1 -r4r — I , sep P (n) = max sep P (x, n) . 

yex V 7r(y) ' 

Also set 

d tv (P"(x,-)-vr)= Yl (n(y)-f n (x,y)), 

yeTr(y)>F n (x,y) 
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""(s.OMO - 1|| 2 = (j2{V n (x,y)/<n(y) - l) 2 vr(y) 



1/2 



j/G* 



When understood from the context, we drop the subscript in sep P . Recall that the function 
n i — > sep(n) is non-increasing and sub-multiplicative (see [1, 2] for more details.) It is well-known 
and is easily seen that d tv (F n (x, •) — vr) < sep(x, n): 

d tv (F n (x,-)-7r) = (Ay)-w n (x,y)) 

yeTr{y)>P*>(x,y) 

< sep(x, n). 

Thus separation bounds total variation. Now we observe that it also controls entropy decay up 
to a factor of log(l/7r*). Recall that the relative entropy, denoted by D (/u 1 1 1^) , of a distribution 
with respect to v is defined as 

x 

where as usual log = 0, and \x is assumed to be absolutely continuous with respect to v (meaning, 
v{x) = implies that n(x) = 0.) It is well-known (see e.g. [4]) that D([i,\\v) < log(l/^), where 
= min 2; z/(x), and that -D(-||^) is convex in the sense that, for < a < 1, and for m, /i 2 
probability distributions (absolutely continuous with respect to v\ 

D(a/ii + (1 - a)n2\\v) < aD(/xi||i/) + (1 - a)D(/i 2 ||z/) • 

Proposition 9. 

D(P"(x,y)M < sep(x,n)log(l/7r,). 

Proof. Let sep(x,n) = e > 0. Then F n (x,y) > (1 - e)ir(y), for all x,y G X. Let 

fi(y) :=(l/e)[P n (x,y)-(l-e)7r(y)], for y 6 X . 

Then \l is a probability distribution on X and P n (x, •) = (1 — e)n + e/i . (Note here that \i implicitly 
depends on x.) By the convexity mentioned above, 

D(F n (x,y)||7r) < eD(yu||7r) < e log(l/^), 

hence the proposition. □ 

Definition 10. For < e < 1/2, let the entropy decay (mixing) time be 

Tent(e) = min{n' : n > n' =>■ max D(P n (x, ■) ||7r) < e} . 

and similarly define the other mixing times r s , r tv , and r 2 with respect to sep(x, n), dt v (P n (x, •)— n), 
and ||P n (rr, ■)/ 7r (") ~~ 1 1 j 2 , respectively. 

It then follows immediately from the above proposition, that: 



S 



Corollary 11. 

Tent(e) < r s (l/e)[loglog(l/7T*) +log(l/e)] . 

It is known that t s = 0(rt v (F) + Tt v 0P*))) where P* denotes the time-reversal of P. Hence the 
assertion claimed at the beginning of this section follows. Note that the lower bound below does 
not need reversibility, and uses the general inequality (known as Pinsker's) for two distributions, ji 
and v, one has: 

2d 2 tv (fi-v) <D(ji\\u). 
Corollary 12. If TP is reversible, then 

T tv (e/2) < r c „t(e) < Cr tv (l/2e) [loglog(l/^) + log(l/e)] , 
for C > an absolute constant. 

Remark 13. Note that the above result shows that the entropy decay time is in general closer to r tv 
than to T2, since there can be a factor of log(l/7r*) between T tv and ti- Similarly this indicates that 
in general po of a reversible Markov chain is closer to the spectral gap than it is to the logarithmic 
Sobolev constant p. 

We now show that the log log 1/7T* gap in the corollary cannot be improved. While a random 
walk on the complete graph (with self-loops) can be shown to establish this, we proceed with the 
following more robust example, which also separates various other mixing times. 

Example 1. Consider the following lamplighter chain on a discrete circle of size n. Unlike the usual 
lamplighter walk, in this chain each vertex of the circle has an m-state lamp for some parameter 
m. However, every time a vertex is visited, the lamplighter completely randomizes the lamp. The 
(discrete) mixing time of the chain is still related to the time it takes to visit all vertices of the base 
graph. In particular, it is easy to see that the total variation mixing time is the expected cover 
time, i.e. 0(n 2 ) and is independent of m. From our result above, it then follows that the entropy 
mixing time of this chain is 0(n 2 log log N) where N = m n . Thus we have 

Tent = 0(n 2 ■ (log n + log log to)). 

Suppose we start the chain at vertex x of the circle and all lamps are in state 0. Let A denote 
the semi-circle consisting of vertices which are at distance n/4 or larger from x. 

There exists an absolute constant c > such that for all a > 0, the probability that a random 
walk on the circle has not touched A after an 2 steps is > exp(— ca). Thus after on 2 steps, the 
entropy is at least exp(— ca) |A| logm since the entropy of the product distribution is the sum of 
the component entropies and the entropy of each non-random lamp is logm. Thus in order for 
this chain to mix in entropy we must have a = O (log (n logm)). Hence r cnt = 0(n 2 log(?ilogm)) 
matching the upper bound given by Corollary 12. 

Now let us look at the L 2 mixing time. In this case, we need to bound the L 2 distance of a 
product space in terms of the independent component L 2 distances. Since 

||/i t / 7 r-l|| 2 |7r = E 7r ( / it/7r) 2 -l, 
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it follows that for t = an 2 , we have 

1 + V&T n (ut/ir) > exp(— ca) rnj = exp(—ca)m n ^ 2 . 

This gives T2 = ft(n 2 log N). 

We observe the following from the above example: 

• If the number of states is N = m n , then we have a G(log N) gap between the variation and 
L 2 mixing times. 

• [9] shows that the relaxation time of Lamplighter chains (with 2-state lamps) equals the 
maximal hitting time of the base chain. The f2(log N) gap between n v and t<i also shows that 
in this m-state lamp case, the relaxation time of the chain is 0(n 2 ). 

• Tent = Q(Ttv log log N) here as well. 

• Finally that we have a chain where the variation, entropy and L 2 -mixing times are all different 
orders of magnitude. 

Note that in the above case the variation mixing time and the relaxation time are of the same 
order of magnitude. To separate these, we need to separate the maximal hitting time and the 
expected cover time of the underlying chain. So it is natural to consider, the m-state lamplighter 
chain on the two-dimensional torus of size n. Recall that for this case of the torus, the maximal 
hitting time is O(nlogn) and the expected cover time is 0(nlog 2 n). To show that the entropy 
time is still separated from the other times, observe that there exists an < a < 1 such that, for 
a = a(n), 

Pr [number of vertices not visited in an log n steps > n a ] > e~ ca , 

where c > is a constant. Now the argument follows as in the one-dimensional case above, and 
gives T cnt = fi(nlog(n log m)). 

4 Questions 

Suppose P, Q are Markov Chains on state space X, y respectively. The Lamplighter chain Q?P has 
state space y x x X , i.e. a configuration of lamps / together with the position of the lamplighter. 
[9] considered y = Z2 and Q on y which completely randomizes the lamp in one step. 

In [5] it is shown that the L2-mixing time of the lamplighter chain on Q I P is related to the 
following generalization of the moment generating function considered in this paper. 

Definition 14. Let P be a Markov Chain on state space X and let 7 > 0. For S > 0, let 7? s denote 
the number of states that have been visited fewer than 7 times up till time S and let Cs(@) = 1E[^ Z ^]- 

Quantities similar to ZX, with first moment computations, were considered in [10] and [7]. The 
first paper to consider moment generating functions of Markov chain related quantities seems to 
be [9]. 
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Question 1. For 9 > 0, S > 0, find bounds on 

F(F,9, 1 ,5)=mf{S:Q(0)<l + 5}. 

For estimating the mixing time of Q I P, where Q is a Markov chain on y, the quantity of interest 
is F(P,p;|,T 3 (Q,e/|*|),e). 

• If 7 > \X\ log#, then it is enough to take S = 0(7|Af|). 

• If P mixes in one step then it reduces to the coupon collector problem and hence for any 
7 > 0, it is enough to take S = 0((j + log |Af|)|Af[). 

• In general, 0((7 + log \X Dl^lTty) is enough, where T tv is the variation mixing time of P. 

Conjecture 1. Let P be reversible with uniform stationary distribution and maximal hitting time 
H = 0(\X\). Then 

F(F, 6, 7 , 5) < C ■ | AT| ( 7 + T rel + log \X\) , (15) 
for some absolute constant C. 
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